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ABSTRACT 

Recent years have seen increased interest in the online pres- 
ence of extreme right groups. Although originally com- 
posed of dedicated websites, the online extreme right milieu 
now spans multiple networks, including popular social media 
platforms such as Twitter, Facebook and YouTube. Ideally 
therefore, any contemporary analysis of online extreme right 
activity requires the consideration of multiple data sources, 
rather than being restricted to a single platform. We in- 
vestigate the potential for Twitter to act as a gateway to 
communities within the wider online network of the extreme 
right, given its facility for the dissemination of content. A 
strategy for representing heterogeneous network data with 
a single homogeneous network for the purpose of commu- 
nity detection is presented, where these inherently dynamic 
communities are tracked over time. We use this strategy 
to discover and analyze persistent English and German lan- 
guage extreme right communities. 
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1. INTRODUCTION 

The online presence of the extreme right has come under 
greater scrutiny in recent years, particularly given events 
such as the Oslo killings by Anders Behring Breivik in July 



2011, Wade Michael Page's attack on a Sikh temple in Wis- 
consin in August 2012 and the uncovering in November 2011 
of the German National Socialist Underground (NSU)'s multi- 
year murder campaign. While initially the extreme right re- 
lied upon dedicated websites for the dissemination of their 
hate content [t] , they have since incorporated the use of so- 
cial media platforms for community formation around vari- 
ants of extreme right ideology [l5]. As Twitter's facility for 
the dissemination of content has been established [2], our 



previous work 28 provided an exploratory analysis of its 
use by extreme right groups. Here we extend this analysis 
to investigate the potential for Twitter to act as a gateway to 
communities located within the online extreme right milieu, 
which spans multiple platforms such as Twitter, Facebook 
and YouTube in addition to other websites. Our analysis is 
focused on two case studies, involving the investigation of 
English and German language online communities. 

We infer network representations of heterogeneous nodes 
and edges to capture the relations between four different ex- 
treme right online entities: (a) Twitter accounts, (b) You- 
Tube channels, (c) Facebook profiles and (d) all other web- 
sites. The inclusion of such platforms allows us to iden- 
tify extreme right communities that would otherwise not 
be apparent when considering Twitter data alone. As on- 
line extreme right networks are inherently dynamic, we are 
also interested in tracking the evolution of the associated 
groups over an extended period of time. Following work by 
Lehmann et al. [24], we apply the community finding pro- 
cess to a single network representation where all node types 
are treated as peers. When interpreting the output of the 
process, the use of heterogeneous data provides us with a 
rich insight into the composition of the communities, and 
serves to differentiate between them. 

In Section [2j we provide a description of related work on 
the online activities of extremist groups, along with discus- 
sion of relevant network analysis methods. The generation of 
two data sets using English and German language tweets is 
then discussed in Section [S] Next, in Section [Ij we describe 
the methodology used to derive network representations and 
subsequently track detected communities. This is followed 
by an analysis of selected persistent dynamic communities 
found in both data sets, where we discuss the merits of the 
network representation and provide characterizations of the 



membership composition in terms of the extent to which 
they span multiple online platforms. Finally, the overall 
conclusions are discussed in Section |6j and some suggestions 
for future work are made. 

2. RELATED WORK 
2.1 Online Extremism 

A number of studies have investigated the online pres- 
ence of extreme right groups, with analysis of associated 
websites receiving substantial attention. An early example 
is that of Burris et al. [t], where social network analysis of 
the network generated from a selection of white supremacist 
websites found evidence of decentralization, with little di- 
vision along doctrinal lines. 



Gerstenfeld et al. 14 analyzed 
the content of various extremist websites, with notable find- 
ings including the discovery of unexpected links between 
white supremacist and Islamist websites, and the potential 
for forging a stronger sense of community between geograph- 
ically isolated groups. Chau and Xu \d\ studied networks 
built from users contributing to hate group and racist blogs, 
where similar decentralization to that of the Burris study 
was detected. Tateo [34] analyzed the website network of 
Italian extreme right groups, also studied by Caiani and 
Wagemann |i8" in addition to the German extreme right web- 
site network. 

More recent work has included the study of online so- 
cial media usage by the extreme right. Bartlett et al. |4] 
performed a survey of European populist party and group 
supporters on Facebook, where they found that these sup- 
porters were more likely to demonstrate than the national 
average, and were also motivated by positive identification 
with party values. Baldauf et al. [s] investigated the use of 
Facebook by the German extreme right. They provided a 
categorization of the active users and groups, explored the 
strategies employed for promoting associated ideology and 
discussed the use of certain themes designed to attract new 
unsuspecting followers, such as stated support for freedom 
of speech and opposition to child sex abuse. Goodwin and 
Ramalingam's [l6] analysis of the radical right in Europe 
discussed the importance of online social media, and sepa- 
rately found a shortage of research on non-electoral forms of 
right-wing extremism. 

A set of reports commissioned by the Swedish Ministry 
of Justice and the Institute for Strategic Dialogue 15 dis- 
cussed the use of social media such as Facebook and You- 
Tube by a number of European extreme right groups. Fi- 
nally, in his investigation of the role of the Internet in in- 
dividuals' radicalization, Kohler [20] used grounded theory 
methodology to analyze interviews conducted with former 
members of the German extreme right, from which he con- 
cluded that online activity is influential as a socialization 
institution. These related studies suggest that communi- 
ties within the online extreme right milieu span multiple 
networks, which provides a motivation for the current work 
to analyze their structure and temporal persistence, while 
also investigating the potential for popular social media plat- 
forms such as Twitter to act as a gateway to this activity. 

2.2 Network Analysis 

An appropriate network representation is required in order 
to analyze the multi-network online activity of the extreme 
right. The inference and relevance problems discussed by De 



Choudhary et al. [12] are applicable here, where the former 
is attributed to the fact that "real" social ties are not directly 
observable and must be inferred from observations of events 
(in our case, tweets from identified extreme right accounts 
on Twitter), while the lack of one "true" social network is 
associated with the latter, due to the potential existence of 
multiple networks each corresponding to a different defini- 
tion of a tie. Separate incompleteness issues also exist given 
the nature of the extreme right. These issues are similar to 
those encountered by Sparrow |3lj and Krebs [21] in their 
respective studies of criminal and terrorist networks, where 
references are made to the inevitability of missing nodes and 
links, the difficulty in deciding who to include and who not 
to include, and the dynamic quality of the networks. As we 
are interested in the potential for Twitter to act as a gateway 
to online extreme right activity, the notion of missing nodes 
and links is realized with the unavailability of older tweet 
data, relevant Twitter accounts that have been suspended, 
profiles that are not publicly accessible and extreme right 
entities that do not maintain a presence on Twitter. 

Our proposed representation of heterogeneous node types 
with a single homogeneous network is also motivated in part 
by previous work in cluster analysis and community find- 
ing, where nodes with different semantics have been treated 
equally during the clustering phase, and subsequently treated 
separately to support the interpretation of the resulting clus- 
ters. For instance, Dhillon [Ts] produced a bipartite spectral 
embedding of words and documents, which allowed both to 
be clustered simultaneously. Lehmann et al. [24] proposed 
an extension of the well-known clique percolation method 
for community finding to identify communities of nodes in 
bipartite graphs. The use of multiple node types avoids the 
loss of important structural information, which can occur 
when performing a one-mode projection of a graph. 

For the analysis of communities in temporal networks, a 



variety of approaches have been proposed. Greene et al. 18 
described a general model for tracking communities in dy- 
namic networks. Communities are identified on each in- 
dividual time step network, which provides a snapshot of 
the data at a particular interval. These communities are 
then matched together to form timelines, representing long- 
lived dynamic communities that exist over multiple succes- 
sive steps. This work was extended in [l7] to address the 
problem of clustering dynamic bipartite networks, where a 
clustering is produced on a spectral embedding of both types 
of nodes simultaneously at each time step before matching. 

3. DATA 

In our previous analysis [28], we collected a number of 
Twitter data sets, each associated with a particular coun- 
try, to facilitate the analysis of contemporary activity by 
extreme right groups. One of our findings was the influ- 
ence of linguistic proximity on relationships within the de- 
tected extreme right communities, in particular, the use of 
the English language. Given this, we merged two of our 
data sets associated with the USA and the United Kingdom 
respectively. This data set, along with that associated with 
German extreme right Twitter accounts, are the starting 
points for this current work. Both data sets were extended 
to include additional accounts that were considered relevant, 
where relevance is determined by criteria such as proflles 
containing extreme right symbols or references to known 
groups; follower relationships with known relevant accounts; 



similar Facebook profiles/YouTube channels; extreme right 
media accounts; a propensity to share links to known ex- 
treme right websites. Similar criteria have been employed 
by other researchers [sj. In total, 1,267 English language 
and 430 German language accounts were identified as being 
relevant. Profile data including followers, friends, tweets and 
list memberships were retrieved for these accounts, as lim- 
ited by the current Twitter API restrictions, between June 
2012 and November 2012. 

We are interested in the potential for Twitter to act as 
a gateway to online extreme right activity through the dis- 
semination of links to external websites; for example, dedi- 
cated websites managed by particular groups, content shar- 
ing websites such as YouTube, or mainstream websites host- 
ing content that could be used to promote associated con- 
cerns. Accordingly, the account tweet content is analyzed to 
extract all external URLs. YouTube URLs are further pro- 
cessed, where profile data is retrieved for any detected video 
and channel (account) identifiers. Similarly, profile data is 
retrieved for any user, group and page identifiers detected 
within Facebook URLs. In both cases, we are restricted to 
data that is publicly accessible. We also extract Twitter ac- 
count identifiers from any URLs that point to content such 
as photos or other images that are hosted directly on Twit- 
ter. Regarding interactions within Twitter itself, we extract 
all mention and retweet events between the identified ac- 
counts. Statistics of the final data sets used in the analysis 
can be found in Table [l] 

4. METHODOLOGY 

4.1 Network Representation 

In this work, we use a single network representation con- 
sisting of four node types: (a) Twitter accounts, (b) You- 
Tube channels, (c) Facebook profiles and (d) other websites, 
where we refer to the latter three as external nodes (with 
respect to Twitter). Heterogeneous edges are also employed 
to refiect the variety of possible interactions; (a) a Twitter 
account mentioning another Twitter account, (b) a Twitter 
account retweeting another Twitter account and (c) a Twit- 
ter account linking to an external node through the inclusion 
of a corresponding URL in a tweet. For the purpose of this 
analysis, no distinction is made between mention or retweet 
events. In a similar approach to the ingredient complement 
network used by Teng et al. [35], we infer edges between 
external nodes whose URLs have appeared in tweets from 
the same Twitter account, in order to further tackle the 
issue of data incompleteness. At this point, we filter any 
YouTube and website nodes whose URLs have appeared in 
tweets from a single account only. As the number of unique 
profiles found in tweets containing Facebook URLs tends to 
be lower, these nodes are not filtered. We do not filter exter- 
nal nodes that could be considered as mainstream (for exam- 
ple, newspapers or TV channels), as we are also interested 
in references to such entities by extreme right accounts. All 
edges are treated as undirected, with raw weights based on 
the frequency of the edge event in question. These weights 
are normalized with the use of pointwise mutual information 
(PMI) as suggested by Teng et al. [35] , where we extend this 
process for heterogeneous node pairs (a, b) as follows: 



For Twitter-Twitter mentions|retweets edges, the PMI is the 
probability that two Twitter accounts a and b have a men- 
tion | retweet relationship against the probability that their 
respective mentions|retweets are separate: 



p{a,b) = 



p{a) = 



# of mentions I retweets between a and 6 
# of mentions I retweets 

# of mentions! retweets featuring a 
# of mcntions|retwccts 

# of mentions j retweets featuring b 



'^^ ^ # of mentions I retweets 

For Twitter-external URL edges, the PMI is the probability 
that a URL external entity b has been tweeted by a par- 
ticular Twitter account a against the probability that their 
respective URL tweet occurrences are separate: 

^ of tweets from a with URL of b 
# of tweets with URLs 



p{a,b) 



p{a) 



# of tweets from a with URLs 



p(fe) = 



# of tweets with URLs 

^ of tweets with URL of 6 

# of tweets with URLs 



For inferred external-external edges, the PMI is the prob- 
ability that two URL external entities o and b have been 
tweeted by a particular Twitter account against the proba- 
bility that they have been tweeted by separate accounts: 



p{a,b) = 



# of accounts that tweeted URLs of both a and b 



p{a) = 



p{b) 



# of accounts that tweeted URLs 

# of accounts that tweeted URL of a 
^ of accounts that tweeted URLs 

# of accounts that tweeted URL of b 



# of accounts that tweeted URLs 

The addition of inferred edges between external nodes can 
occasionally lead to dense networks, particularly with the 
inclusion of accounts that are prolific tweeters. As the PMI 
weights for these edges tend to be normally distributed, we 
filter any such edges with PMI < ^ -I- 2a. All other edges 
are retained. 

4.2 Community Detection 

As with our previous analysis ^28j, we use our variant 
[19| of the Lancichinetti and Fortunate consensus clustering 
method [22] to generate a set of stable consensus communi- 
ties from an inferred network instance, based on 100 runs of 
the OSLOM algorithm 23 . As before, we selected a value 



PMI(a,b) = log 



p{a)p{b)) 



of 0.5 for the threshold parameter r used with the consen- 
sus method, as a compromise between node retention and 
more stable communities. Although we previously detected 
communities within a static graph representation, these net- 
works are inherently dynamic, and so we are interested in 
the topology of these communities over an extended period 
of time. Using the method of Greene et al. [Ts] , we repre- 
sent this dynamic network as a set of I time step networks 
{g\, . . . ,gi}, employing a sliding window approach to gener- 
ate snapshots of the nodes and edges in the overall network 
at successive intervals. 



Data set 


Tweets 


Mentions 


Retweets 


All URLs 


YouTube URLs 


Facebook URLs 


English language 


1,517,339 


539,181 


162,042 


972,444 


71,049 


23,007 


German language 


54,873 


14,263 


11,028 


88,309 


3,868 


6,924 



Table 1: Data set statistics for the period of June 1st, 2012 to November 16th, 2012. 



The selection of the sliding window size has a direct im- 
pact on the topology of the generated network snapshots. 
In their discussion of dynamic proximity networks, Clauset 
and Eagle ^lOj claim that dynamics are a multi-scale phe- 
nomenon, with variation taking place at several distinct time 
scales. However, they also suggest that a natural time scale 
exists with which important temporal variation is preserved. 
In this analysis, we consider this natural time scale to be re- 
lated to the activity of the identified extreme right Twitter 
accounts. We calculated the mean percentage of accounts 
active in each data set within a series of increasing time 
scales ranging from one day up to a duration of eight weeks; 
a plot of these values can be seen in Figure [l] Although the 
accounts in the English language data set seem to be more 
active than those of the German language data set, the rel- 
ative activity trends are highly similar. In both cases, there 
is a sharp increase in the percentage of active accounts be- 
tween the daily and weekly time scales, with a lesser increase 
between the weekly and bi-weekly scales. At this point, the 
rate of increase slows down, which suggests that using a win- 
dow size of two weeks is appropriate for meaningful analysis 
rather than a larger size whereby the corresponding snap- 
shot networks become increasingly static. When using non- 
overlapping sliding windows, we found that the generated 
step networks exhibited a certain amount of volatility, po- 
tentially due to data incompleteness or a variance in Twitter 
usage patterns in different countries, for example, the use of 
Twitter tends to be more prevalent in the UK and the USA 
than in other countries such as Germany. We use overlap- 
ping sliding windows in an attempt to smooth this volatility, 
where the overlap is a period of one week. 

Another factor that can influence volatility is related to 
those nodes that are present in the step networks on an 
intermittent basis. For example, an offline extreme right 




Figure 1: Mean percentage of Twitter accounts active at 
increasing time scales (1 day up to 8 weeks) for English 
and German language data sets, between June 2012 and 
November 2012. 



event may lead to increased activity from certain Twitter 
accounts or references to external media nodes. This volatil- 
ity impacts the Greene et al. [Ts] method, where a dynamic 
community timeline is created by matching step communi- 
ties to existing dynamic communities over successive steps. 
To address this problem, we extend the method as follows: 

1. Rather than solely relying on the most recent observa- 
tion (referred to as the front) in an existing dynamic 
community timeline for comparison with the set of step 
communities at each step, we consider all historical 
step communities in a timeline as comparison candi- 
dates. This permits the consideration of those nodes 
that are intermittently active. 

2. We use a modified version of the Jaccard coefficient for 
binary sets to calculate the similarity between a step 
community Cta and a single candidate historical step 
community Dtb belonging to an existing dynamic com- 
munity timeline, based on the undirected cluster rep- 
resentativeness metric suggested by Bourqui et al. [6] : 



sim{Cta,Dtb) = 



Cta n Dtl 



\CtanDt, 

\Dtb\ 



3. The overall similarity of a step community with the 
entire set of candidate communities belonging to an 
existing dynamic community timeline is calculated as 
an exponential weighted moving average of the indi- 
vidual similarities, with a decay factor a = 0.5. A 
match occurs if the overall similarity > 0.25. 

5. ANALYSIS 

5.1 Persistent Dynamic Communities 

The method described in the previous section was applied 
to our two case study data sets, on data collected from June 
1st, 2012 to November 16th, 2012. Using a sliding window 
length of two weeks with a one week overlap resulted in 
twenty-three time steps. A network representation was de- 
rived for each of these steps, and the dynamic community 
matching process was then applied to these step networks. 
In our analysis, we are particularly interested in persistent 
dynamic communities, where a community is defined as per- 
sistent if it has members active in all time steps. The total 
number of such dynamic communities found in the English 
language and German language data sets were fifteen and 
four respectively. We have selected two persistent commu- 
nities from each data set for detailed analysis, and timeline 
diagrams of the constituent step communities can be found 
in Figures [2a| and [25] 

From an initial inspection of these timeline diagrams, it 
can be seen that these dynamic communities exhibit a cer- 
tain amount of volatility, with numerous merge and split 
life-cycle events occurring. As in 18 , a merge occurs if mul- 



tiple existing dynamic communities are matched to a single 



01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 




(a) EDL (top) and BNP (bottom) communities, containing an average of 14.9% of the nodes in each step network (a = 2%) 
01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 



(b) German non-electoral (top) and NPD (bottom) communities, containing an average of 50% of the nodes in each step network (cr = 7%) 

Figure 2: Timelines of selected persistent dynamic communities for the (a) English and (b) German language datasets, from 
June 2012 (step t = 1) to mid-November 2012 (step t — 23). Each step corresponds to a two week window, with one week 
overlaps. White rectangles indicate that the associated dynamic community was not matched in the corresponding step. 



step community at time t. A split occurs if a single dy- 
namic community is matched to multiple step communities 
at time t. On occasion, we have found that an existing dy- 
namic community may be a candidate for both a merge and 
a split event at a particular time t. In this scenario, we give 
precedence to merge events and a new dynamic community 
is created for the step community that generates the split. 
Both timeline diagrams also include branches of step com- 
munities that are distinct upon initial creation, but become 
merged in a subsequent step. 

An analysis of the nodes in the first English language com- 
munity in Figure [2a] shows that it is associated with the 
English Defence League (EDL), a street protest movement 
opposed to the alleged spread of radical Islamism within the 
UK. This community also contains nodes associated with 
the British Freedom party, a splinter group from the British 
National Party (BNP) that formed an alliance with the 
EDL in 2012 |16[ |26| . We observe the presence of Casuals 
United, a protest group formed from an alliance of football 
hooligans, also linked with the EDL. The second English 
language community is primarily associated with the BNP, 
with nodes from other groups such as Combined ex-Forces 
(CxF), Infidels and The British Resistance also present [15] . 
Other notable persistent dynamic communities from this 
data set include two white power/national socialist com- 
munities largely consisting of North American and South 
African nodes respectively. 

The first of the selected German language communities 
shown in Figure |2b| appears to be associated with a vari- 



ety of non-electoral groups. These include auflerparlamen- 
tanscher Widerstand (non-parliamentary resistance) enti- 
ties, Freies Netz (neo-Nazi collectives), along with various 
"information/news portals" [27]. Two organizations banned 
by the German authorities in 2012, namely Spreelichter [SO] 
and Besseres Hannover [33], are also present. The second 
German language community is electoral in nature, asso- 
ciated with the Nationaldemokratische Partei Deutschlands 
(National Democratic Party of Germany - NPD), and in- 
cludes nodes representing regional NPD offices and individ- 
ual politicians. Although this community appears to be as- 
sociated with regions across Germany, a smaller persistent 
NPD community, localized to the federal state (Bundesland) 
of Thiiringen can also be observed. This smaller commu- 
nity contains a mixture of electoral (NPD) and non-electoral 
nodes. 

5.2 Community Structure 

In this section, we discuss the merits of using a single 
network representation of heterogeneous node types in our 
analysis of extreme right communities. As the online pres- 
ence of these groups extends beyond individual networks 
such as Twitter or Facebook, the proposed representation 
permits the structure of these wider communities to be re- 
vealed, which would otherwise not be evident if analysis was 
restricted to a single network. The representation also sup- 
ports our objective of studying the potential for Twitter to 
act as a gateway to online extreme right activity. 

We illustrate this by visualizing individual step commu- 



(a) All nodes 



(b) Twitter nodes 



(a) All nodes 



(b) Twitter nodes 



Figure 3: BNP community. Filtering results in central node 
loss and disconnection (grey=Twitter, blue=Facebook, or- 
ange=YouTube, red=other websites). 



Figure 5: German non-electoral community. Filtering re- 
sults in central node loss (grey=Twitter, orange=YouTube, 
red=other websites). 



nities belonging to the persistent dynamic communities se- 
lected from the data sets, as described in the previous sec- 
tion. Network diagrams of these communities were created 
with Gephi 5', using the Yifan Hu layout. Figurejslcontains 
diagrams for a BNP step community, with Figure [3a| present- 
ing the community with all nodes, while non-Twitter nodes 
have been filtered in Figure |3b| (42% of nodes and 28% of 
edges are retained), resulting in the loss of important cen- 
tral nodes such as the official BNP website and YouTube 
channel. The contrast between Figures [3a| and [Sb] illustrates 
the important linking role played by non-Twitter nodes, as 
the observable network is disconnected when they are not 
considered. In addition, the use of heterogeneous nodes en- 
ables us to immediately identify this community as being 
associated with the BNP. Similar diagrams for an EDL step 
community can be seen in Figure [4] with 66% of nodes and 
56% of edges retained with filtering (Figure 4b I. Although 



disconnection is not introduced, it nevertheless results in the 
loss of central nodes such as the official EDL and British 
Freedom websites. 

We also present examples from the selected German lan- 
guage communities. The filtering of the non-electoral com- 



munity in Figure [5] (58% of nodes and 46% of ed ges are 
retained) produces a similar effect to that of the EDL com- 
munity, where important central nodes such as the Besseres 
Hannover website and those associated with other relevant 
information portals are removed. Our last example in Fig- 
ure |6] demonstrates the potential for severe loss of commu- 
nity structure when filtering is applied (32% of nodes and 3% 
of edges are retained). In this example, although there is al- 
most no communication (in terms of mentions and retweets) 
between the Twitter nodes, they are considered members of 
the same community within the wider network. 
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(b) Twitter nodes 



• •• . 



• , • • • • 

• ••.V."* • • • 

• • • • ' . 

• • • 



(a) All nodes 
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(b) Twitter nodes 



Figure 4: EDL community. Filtering results in central node 
loss (grey=Twitter, blue=Facebook, orange= YouTube, 
red=other websites). 



Figure 6: NPD community. Filtering results in central node 
loss and disconnection (grey=Twitter, blue=Facebook, or- 
ange^ YouTube, red=other websites). 

5.3 Community Characterization 

We now provide a characterization of the selected persis- 
tent dynamic communities shown in Figure[2] with a detailed 
analysis of the member nodes. To address our objective of 
Twitter's potential to act as a gateway to the wider online 
network of the extreme right, we pay particular attention 
to the external website nodes. Due to the sensitivity of the 
subject matter, and in the interests of privacy, individual 
accounts and profiles from networks such as Twitter, Face- 
book and YouTube are not explicitly identified. Instead, we 
restrict discussion to known extreme right groups and their 
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(b) Normalized degree 

Figure 7: EDL community characterization, based on two 
alternative rankings of website nodes. 



affiliates. For each of the selected communities, we provide 
two alternative top ten rankings of the website nodes; the 
first ranks the nodes in terms of their frequency of step mem- 
bership, while the second ranks them in terms of their total 
degree across all steps, normalized with respect to the total 
number of steps. In both cases, the membership ranking 
distribution tends to be positively skewed, with a set of core 
community members assigned in the majority of steps, ac- 
companied by a larger set of peripheral members who were 
assigned intermittently. 

Figure [7] presents the two rankings of website nodes for 
the EDL community. As might be expected, the frequency 
ranking includes the official EDL and British Freedom web- 
sites. Other websites and blogs affiliated with the EDL are 
also present, including a Casuals United blog. Of similar 
interest is a blog that appears to be a mocking reference 
to the established anti-extremist organization, HOPE not 
hate. The degree ranking produces broadly similar results, 
with notable exceptions being the appearance of mainstream 
media websites. An analysis of the original URLs finds them 
to be associated with populist newspaper articles on topics 
such as the existence of UK sex grooming gangs, Muslim 
integration, and immigration in general. Separate studies 
have found interest in these topics by other extreme right 
groups [3] , and it would appear that mainstream media may 
unwittingly play a role in the occasional promotion of ma- 
terial that coincides with extreme right ideology. Other ar- 
ticles that are interpreted as perceived media persecution 
of groups such as the EDL are also promoted. Looking at 
the non-website nodes, official EDL Twitter and Facebook 
accounts can be observed, along with those purporting to 
be affiliates. Similar YouTube channels are also present, in- 
cluding a British Freedom channel that was suspended in 
late 2012 (this has since been replaced with a new channel 
according to their website). 
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Figure 8: BNP community characterization, based on two 
alternative rankings of website nodes. 



We find similar membership composition in the BNP com- 
munity, where the website rankings are presented in Fig- 
ure [S] The frequency ranking highlights websites associated 
with the BNP and that of its current leader, Nick Griffin. 
As mentioned in an earlier section, this community also con- 
tains members associated with The British Resistance, a 
white nationalist group whose website banner contains the 
14-word slogan coined by the American white supremacist 
David Lane ( "We must secure the existence of our people and 
a future for White Children"). As with the EDL community, 
mainstream media websites appear in the degree ranking, 
where the corresponding article topics are similar to those 
promoted by the EDL. The presence of additional groups 
such as Combined ex-Forces (CxF) and the Infidels can be 
found when the Twitter and Facebook nodes are inspected. 
The latter group originally splintered from the EDL, and al- 
though it has a minor presence within the EDL community, 
it would seem that the BNP's efforts to court this group 
[25| may partly explain its more significant role in this com- 
munity. The most prominent YouTube node is the official 
BNP channel, with other channels of a nationalist nature 
appearing sporadically. 

In the case of the German language website rankings. Fig- 
ure |9] shows blogs and websites from a variety of disparate 
groups for the non-electoral community. The majority of 
these are associated with known non-electoral groups, al- 
though mainstream media websites are also present. An 
analysis of the associated articles demonstrates once again 
the promotion of perceived persecution, for example, the 
banning of the Spreehchter and Besseres Hannover groups 
features heavily here. We also see a connection to electoral 
groups with the appearance of the NPD-affiliated MUPInfo 
website. Further connections to the NPD are visible as var- 
ious NPD Facebook page nodes are also intermittent mem- 
bers of this community. A notable temporary Facebook 



identitaet.tumblr.com 
infoportal-sclnwaben.net 
pinselstriche.org 
freies-netz-sued.net 
aitermedia-deutsctiland.info 
mupinfo.de 
senftenberger.biogspot.com 
besseres-fiannover.info 
rufirnacfiricfiten.de 
widerstand.info 



III 

5 10 15 


1 

20 


1 

25 


(a) Membership frequency 






besseres-hannover.info I I 



identitaet.tumbir.com I I 

widerstand.info I I 

aitermedia-deutscfiland.info I I 

mupinfo.de I I 

pinseistriche.org | | 

spiegei.de I I 

freies-netz-sued.net I I 

senftenberger.biogspot.com I I 

infoportai-schwaben.net I I 

I 1 1 1 1 1 

2 4 6 8 10 

(b) Normalized degree 

Figure 9: German non-electoral community characteriza- 
tion, based on two alternative rankings of website nodes. 

member node was a page (no longer available) encourag- 
ing "solidarity" with Nadja Drygalla, a member of the 2012 
German Olympic rowing team who left the tournament early 
due to the neo-Nazi connections of her boyfriend, who was 
a former NPD election candidate [321. The website rankings 
for the NPD community in Figure |10| are primarily com- 
posed of NPD-afEliated websites, with a minor number of 
mainstream websites also appearing for the same reasons 
as before. Both communities feature a moderate YouTube 
presence, with nodes such as NPD channels, music channels 
and individual channels featuring videos of demonstrations, 
including some of the Unsterblichen (immortals) marches 
that were orchestrated by Spreehchter 3^ 30 . 

5.4 Twitter Activity and External Events 

We also perform a brief inspection of community Twitter 
activity, providing examples for a single persistent dynamic 
community from both data sets. Based on knowledge of ex- 
ternal offline events associated with these communities, we 
generate three month plots of the daily total tweet counts 
for (a) the accounts assigned in the corresponding two-week 
step community and (b) the remaining accounts in the data 
set, where the raw counts are z-score normalized. In both 
cases, we find that peaks in tweeting activity by the commu- 
nity accounts may be associated with external events. For 
example, the EDL activity plot in Figure \TTa\ highlights in- 
creased activity around the times of street demonstrations, 
with other events such as the anniversary of the July 7th, 
2005 London bombings being of particular interest to this 
community. Although the German non-electoral community 
plot in Figure [llb| also contains raised activity for a demon- 
stration, other events such as the Besseres Hannover ban 
(initially banned by the German authorities [33]; their ac- 
count was subsequently blocked by Twitter within Germany 
have a similar impact. These peaks may introduce tem- 
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Figure 10: NPD community characterization, based on two 
alternative rankings of website nodes. 



porary community members, for example, a file-sharing site 
containing an archive of the besseres-hannover.info website 
appears for a number of days at the time of the initial ban. 

5.5 Summary Discussion 

In general, the membership composition of the selected 
communities would appear to broadly correspond with con- 
temporary knowledge of these groups; for example, the dis- 
tinction between the EDL and the BNP [25]. To a certain 
extent, we also see divisions according to the four-fold typol- 
ogy suggested by Goodwin et al. [T6| , where these communi- 
ties could be characterized by three of the four types; orga- 
nized political parties (BNP, NPD), grassroots social move- 
ments (EDL) and smaller groups (German non-electoral). 
However, it could also be suggested that some overlap be- 
tween these types can occur, for example, the presence of 
the Infidels and British Resistance in the BNP community, 
or NPD-affiliated nodes in the German non-electoral com- 
munity. In addition, the finding of Bartlett et al. J4| that 
online supporters of groups such as the EDL are more likely 
to demonstrate than the national average may partly ex- 
plain the increased levels of community Twitter activity at 
the time of external protests. 

We also note the use of official Facebook profiles and 
YouTube channels by electoral parties and grassroots move- 
ments in addition to those maintained by related individu- 
als, while general blogging websites (often hosted in other 
countries) appear popular with most groups. Separately, all 
groups appear to selectively reference mainstream media, 
directing traffic to material that coincides with associated 
ideology. At this point, we also emphasize that the network 
representations used in this analysis do not provide cover- 
age of all online extreme right activity. As mentioned ear- 
lier, data collection from Twitter, YouTube and Facebook 
was restricted to publicly accessible content. Given existing 
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(b) German non-electoral community tweet activity from August to November 2012. 
Figure 11: Tweet activity plots of communities, produced via z-score normalization of tweet counts, for selected time periods. 



knowledge of the use of online platforms such as Facebook by 
the extreme right [sj^j, a certain level of incompleteness is 
to be expected. In addition, the fact that we use Twitter as 
a gateway to infer structure within the wider online network 
of the extreme right may also introduce incompleteness. 

6. CONCLUSIONS AND FUTURE WORK 

The online activity of extreme right groups has progressed 
from the use of dedicated websites to span multiple net- 
works, including popular social media platforms such as 
Twitter, Facebook and YouTube. As its role in the general 
dissemination of content has previously been established, we 
have investigated the potential for Twitter to act as a gate- 
way to communities located within this wider network. By 
representing relations between the heterogeneous network 
entities with a single homogeneous network, we are able to 
identify extreme right communities that would otherwise not 
be evident when considering a single online network alone. 
The use of heterogeneous data provides us with a rich insight 
into the composition of the resulting communities. Our anal- 
ysis has focused on the investigation of English and German 
language communities using two separate data sets, where 
we have tracked community evolution in these inherently 
dynamic networks over an extended period of time. Two 
persistent communities found in each of the data sets were 
selected for detailed analysis. We have discussed the impact 
of heterogeneous nodes on community topology, and used 
these nodes to provide community characterizations, partic- 
ularly in terms of the extent to which these communities 
span multiple online platforms. 

In our dynamic community analysis of both data sets, 
we have found that the individual step networks tended to 



exhibit a considerable level of volatility. Although this may 
be due in part to data incompleteness, it is possible that 
such volatility may simply be a feature of the online extreme 
right presence. It is likely that this corresponds to factors 
such as the continual emergence of new groups [26], while 
events such as the jailing of the EDL's Tommy Robinson 
[29| or a potential ban of the NPD ]T] are also likely to 
impact volatility. We would like to address this issue in 
future work, which may involve an extension of the process 
currently used to track dynamic communities. In addition, 
it would be worthwhile to investigate the potential for other 
social media platforms such as Facebook to act Eis a gateway 
to online extreme right activity, where a comparison could 
be made with the current results. This should also help to 
address the issue of data incompleteness. 
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